Classification of Scientific Publications using Swarm Intelligence
نویسندگان
چکیده
Document classification is an important task in data mining. Currently, identifying category (i.e., topic) of a scientific publication is a manual task. The Association for Computing Machinery Computing Classification System (ACM CCS) is most wildly used multi-level taxonomy for scientific document classification. Correct classification becomes difficult with an increase in number of levels as well as in number of categories. Domain overlapping aggravates this problem as a publication may belong to multiple domains. Thus manual classification to taxonomy becomes more difficult. Most of the existing text classification schemes are based on the Term Frequency and Inverse Document Frequency (TF-IDF) technique. Similar approaches become computationally inefficient for large datasets. Most of the techniques for text classification are not experimentally validated on scientific publication datasets. Also, multi-level and multi-class classification is missing in most of the existing schemes for document classification. The proposed approach is based on metadata (i.e., structural representation), in which only the title and keywords are considered. We reduced the features set by dropping some of the metadata, like abstract section of the scientific publication that diversifies the result accuracy. The proposed solution was inspired from the well-known evolutionary Particle Swarm Optimization (PSO). The proposed technique results in overall 84.71% accuracy on Journal of Universal Computer Science (J.UCS) dataset.
منابع مشابه
S3PSO: Students’ Performance Prediction Based on Particle Swarm Optimization
Nowadays, new methods are required to take advantage of the rich and extensive gold mine of data given the vast content of data particularly created by educational systems. Data mining algorithms have been used in educational systems especially e-learning systems due to the broad usage of these systems. Providing a model to predict final student results in educational course is a reason for usi...
متن کاملGPU-Accelerated Data Mining with Swarm Intelligence
Swarm intelligence describes the ability of groups of social animals and insects to exhibit highly organized and complex problem-solving behaviors that allow the group as a whole to accomplish tasks which are beyond the capabilities of any one of the constituent individuals. This natural phenomenon is the inspiration for swarm intelligence systems, a class of algorithms that utilizes the emerge...
متن کاملRemote Image Classification Using Particle Swarm Optimization
In order to have clarity in the satellite images we have used Particle Swarm Optimization technique. When incorporated with traditional clustering algorithms, problems such as local optima and sensitivity to initialization, are reduced, thus exploring a greater area using global search. This segmented image is further classified using Kappa coefficient. Keywords— Particle Swarm Optimization(PSO...
متن کاملSoft Computing Methods based on Fuzzy, Evolutionary and Swarm Intelligence for Analysis of Digital Mammography Images for Diagnosis of Breast Tumors
Soft computing models based on intelligent fuzzy systems have the capability of managing uncertainty in the image based practices of disease. Analysis of the breast tumors and their classification is critical for early diagnosis of breast cancer as a common cancer with a high mortality rate between women all around the world. Soft computing models based on fuzzy and evolutionary algorithms play...
متن کاملAnalysis of Scientific Publications in the Field of Ethics in Accounting
Background: Scientific articles represent the efforts of researchers and are useful and valuable source of information and can be taken as a basis for scientific and performance analysis. The purpose of this research is to study the scientific production of the subject area of ethics in accounting. Method: This descriptive-analytical research examined 145 articles of the subject area of ethics ...
متن کامل